Weak measurement takes a simple form for cumulants 



Graeme Mitchisoi|3 

Centre for Quantum Computation, DAMTP, University of Cambridge, Cambridge CBS OWA, UK 

A weak measurement on a system is made by coupling a pointer weakly to the system and then 
measuring the position of the pointer. If the initial wavefunction for the pointer is real, the mean 
displacement of the pointer is proportional to the so-called weak value of the observable being 
measured. This gives an intuitively direct way of understanding weak measurement. However, if 
the initial pointer wavefunction takes complex values, the relationship betweenpointer displacement 
and weak value is not quite so simple, as pointed out recently by R. Jozsa This is even more 
striking in the case of sequential weak measurements 01 . These are carried out by coupling several 
pointers at different stages of evolution of the system, and the relationship between the products of 
the measured pointer positions and the sequential weak values can become extremely complicated 
for an arbitrary initial pointer wavefunction. Surprisingly, all this complication vanishes when one 
calculates the cumulants of pointer positions. These are directly proportional to the cumulants of 
sequential weak values. This suggests that cumulants have a fundamental physical significance for 
weak measurement. 
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I. INTRODUCTION 

In physics, formal simplicity is often a reliable guide to the significance of a result. The concept of weak measurement, 
^ ' due to Aharonov and his coworkers [3,3], derives some of its appeal from the formal simplicity of its basic formulae. 
^ One can extend the basic concept to a sequence of weak measurements carried out at a succession of points during 
^H' the evolution of a system 2], but then the formula relating pointer positions to weak values turns out to be not 
quite so simple, particularly if one allows arbitrary initial conditions for the measuring system. I show here that the 
complications largely disappear if one takes the cumulants of expected values of pointer positions; these are related 
in a formally satisfying way to weak values, and this form is preserved under all measurement conditions. 
^f-^ ' The goal of weak measurement is to obtain information about a quantum system given both an initial state \'ipi) 
^-H \ and a final, post-selected state \ipf). Since weak measurement causes only a small disturbance to the system, the 
measurement result can reflect both the initial and final states. It can therefore give richer information than a 
\ conventional (strong) measurement, including in particular the results of all possible strong measurements [1, Q. To 
, carry out the measurement, a measuring device is coupled to the system in such a way that the system is only slightly 
perturbed; this can be achieved by having a small coupling constant g. After the interaction, the pointer's position q 
C is measured (or possibly some other pointer observable; e.g. its momentum p). Suppose that, following the standard 

von Neumann paradigm, Q, the interaction between measuring device and system is taken to be Hint = 95{t)pA, 
where p is the momentum of a pointer and the delta function indicates an impulsive interaction at time t. It can be 
shown ^ that the expectation of the pointer position, ignoring terms of order g" or higher, is 



{q)=gReA^, (1) 
where is the weak value of the observable A given by 

As can be seen, ([IJ has an appealing simplicity, relating the pointer shift directly to the weak value. However, this 
formula only holds under the rather special assumption that the initial pointer wavefunction </> is a gaussian, or, more 
generally, is real and has zero mean. When is a completely general wavefunction, i.e. is allowed to take complex 
values and have any mean value P, 0] , equation ([1]) is replaced by 

('?) = + gReA^ + glmA^ {{pq + qp)i -2{q)i{p)^), (3) 
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where, for any pointer variable x, {x)i denotes the initial expected value {(f)\x\(f)) of x; so for instance {q)i and {p)i are 
the means of the initial pointer position and momentum, respectively. (Again, this formula ignores terms of order g'^ 
or higher.) 

Equation ^ seems to have lost the simplicity of but we can rewrite it as 

{q) = {q),+gRe{^A^), (4) 

where 

e = -2i((gp),-(g),(p),), (5) 

and equation (j4|) is then closer to the form of ([T]). As will become clear, this is part of a general pattern. 

One can also weakly measure several observables, Ai, . . . , An, in succession pi]. Here one couples pointers at several 
locations and times during the evolution of the system, taking the coupling constant gk at site k to be small. One 
then measures each pointer, and takes the product of the positions qk of the pointers. For two observables, and in 
the special case where the initial pointer distributions are real and have zero mean, e.g. a gaussian, one finds 



(91-72) = ^ i?e [{A2,A,)^ + {A,UA2)^\ , (6) 

ignoring terms in higher powers of gi and g2- Here {A2, Ai)u, is the sequential weak value defined by 

{^f\WA2VA,U\^,) 
^^^^^^>-~ (^^.|WC/|^.) ' 

where C/ is a unitary taking the system from the initial state |'0i) to the first weak measurement, V describes the 
evolution between the two measurements, and W takes the system to the final state. (Note the reverse order of 
operators in {A2, Ai), which reflects the order in which they are applied.) If we drop the assumption about the special 
initial form of the pointer distribution and allow an arbitrary 0, then the counterpart of ^ becomes extremely 
complicated: see Appendix, equation lAll 

Even the comparatively simple formula ([6]) is not quite ideal. By analogy with ([T]) we would hope for a formula of 
the form ((71(72) oc Re{A2, Ai)^^,, but there is an extra term (^1)^(^2)^. What we seek, therefore, is a relationship 
that has some of the formal simplicity of ([T]) and furthermore preserves its form for all measurement conditions. It 
turns out that this is possible if we take the cumulant of the expectations of pointer positions. As we shall see in the 
next section, this is a certain sum of products of joint expectations of subsets of the qi, which we denote by {qi . . . qnY- 
For a set of observables, we can define a formally equivalent expression using sequential weak values, which we denote 
by {Am ■ . . ,Ai)1j. Then the claim is that, up to order n in the coupling constants g^ (assumed to be all of the same 
approximate order of magnitude): 

{qi... qnY = gi... guRe {^{An, . . . , Aif^} , (8) 

where ^ is a factor dependent on the initial wavefunctions for each pointer. Equation ([5]) holds for any initial pointer 
wavefunction, though different wavefunctions produce different values of ^. The remarkable thing is that all the 
complexity is packed into this one number, rather than exploding into a multiplicity of terms, as in (jAip . 

Note also that (|4|) has essentially the same form as ([8]) since, in the case n = 1, (A)'^ — However, there is an 
extra term {q)i in this arises because the cumulant for n = 1 is anomalous in that its terms do not sum to zero. 



II. CUMULANTS 



Given a collection of random variables, such as the pointer positions qi, the cumulant {qi . . . qnY is a polynomial in 
the expectations of subsets of these variables [1, [9| ; it has the property that it vanishes whenever the set of variables 
qi can be divided into two independent subsets. One can say that the cumulant, in a certain sense, picks out the 
maximal correlation involving all of the variables. 

We introduce some notation to define the cumulant. Let x be a subset of the integers {!,..., n}. We write q 

nl=i where is the size of x and the indices of the g's in the product run over all the integers x{i) in x. 
Then the cumulant is given by 

{q,...qnY= J2 «'^n(nV' (9) 

b={bi,...M j=i \ bj I 
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where b = {bi, . . . ,bk} runs over all partitions of the integers {1, . . . , n} and the coefficient Ofc is given by 

afc = (fc-l)!(-l)'=-i. (10) 

For n — 1 we have (q)'^ — {q), and for n — 2 

{<li<l2Y ^ {qiq2) - {qi){q2)- (11) 
There is an inverse operation for the cumulant 0, [l3| : 
Proposition II. 1. 

6={6i,...,6fc}i=l \ 6j / 

Proof. To see that this equation holds, we must show that the term 11^=1(116 obtained by expanding the right-hand 
side is zero unless b is the partition consisting of the single set {1, . . . , n}. Replacing each subset bj by the integer j, 
this is equivalent to ^ a^j . . . au^ = 0, where the sum is over all partitions of {1, . . . , fc} by subsets of sizes ki, . . . ,kr 
and the a^'s are given by (jlOp . In this sum we distinguish partitions with distinct integers; e.g. {1,2}, {3, 4} and 
{1, 3}, {2, 4}. There are (^^ ) {hi . . . Zfc!)~^ such distinct partitions with subset sizes ki . . . kr, where k is the number 

of A;'s equal to i, so our sum may be rewritten as fc! ■ ■ • (— l)'^''~"'^(^i! ■ • ■ ^fc'fci ■ • • fcr)~^, where the sum is 

now over partitions in the standard sense [llj. This is fc! times the coefficient of x'^ in 

l + - + ^ + ...Vl + (--V2) + ^^^ + ...Vl + (xV3) + ^^ + ...V.. (13) 



2! 7 V 2! • ---J y ■ \ / ' ■ 2! 

^ g.-.V2+^V3... = g;o3e(l+.) ^ 1 + (14) 

Thus the sum is zero except for fc = 1, which corresponds to the single-set partition b. □ 

Definition II. 2. // {1, . . . ,n} can be written as the disjoint union of two subsets Si and S2, we say the variables 
corresponding to these subsets are independent if 

(H'?!!'?) = (11'?) (11 9)' (15) 

S2 S[ S2 

for any subsets C Si. 

We now prove the characteristic property of cumulants: 

Proposition II. 3. The cumulant vanishes if its arguments can be divided into two independent subsets. 

Proof. For n — 2 this follows at once from ([TT|) and (fTS]). and we continue by induction. From (fT^ and the inductive 
assumption for n — 1, we have 

(gi...g„)^(gi...g„)^+ ^ lillil) E 11(11'?) • (l^) 

This holds because any term on the right-hand side of vanishes when any subset of the partition b includes 
elements of both Si and 5*2 . Using again, this implies 

{qi...qn) = {qi---qnr + {l[q){l[q), (17) 

Si S2 

and by independence, {qi ■ . ■ qnY — 0- Thus the inductive assumption holds for n. □ 



In fact, the coefficients au in ([9|) are uniquely determined to have the form ()10|) by the requirement that the cumulant 
vanishes when the variables form two independent subsets [T^ . [Tsj . 
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For n = 2, the cumulant (fTTj) is just the covariance, {qiq2Y = ((^i ^ ('?i))('?2 — (92))), and the same is true for 
n = 3, namely {qiq2q'iY — iili ~ {Qi)){Q2 — ('Z2))(93 — (93)))- For n — A, however, there is a surprise. The covariance 
is given by 

4 

{Y[{qi - {qi))) = (91929394) - ^{qiqjqk){qi) + ^{qiqj){qk){qi) - i{qi){q2){q3){q4), (18) 

where the sums include all distinct combinations of indices, but the cumulant is 

(91929394)'' = (91929394) - ^{qiqjqk){qi) - ^{qiqj) {qkqi) + '^^{qiqj){qk){qi) - 6((7i)(g2)(93)(94), (19) 

which includes terms like (9192) (9394) that do not occur in the covariance. Note that, if the subsets {1,2} and {3,4} 
are independent, the covariance does not vanish, since independence implies we can write the first term in (|18|) as 
(91929394) = (9192) (9394) and there is no cancelling term. However, as we have seen, the cumulant does contain such 
a term, and it is a pleasant exercise to check that the whole cumulant vanishes. 



III. SEQUENTIAL WEAK VALUES AND CUMULANTS 

To carry out a sequential weak measurement, one starts a system in an initial state then weakly couples 

pointers at several times during the evolution of the system, and finally post-selects the system state IV'/)- One 
then measures the pointers and finally takes the product of the values obtained from these pointer measurements. It 
is assumed that one can repeat the whole process many times to obtain the expectation of the product of pointer 
values. If one measures pointer positions qk, for instance, one can estimate {qi . . . (?„), but one could also measure the 
momenta of the pointers to estimate {pi . . -Pn)- 

If the coupling for the fcth pointer is given by Hint — S{t — tk)rkp, and if the individual initial pointer wavefunctions 
are gaussian, or, more generally, are real with zero mean, then it turns out ^] that these expectations can be expressed 
in terms of sequential weak values of order n or less. Here the sequential weak value of order n, (A„, . . .Ai)^, is 
defined by 



[An,... A,) 



{2Pf\Un+l...Ui\2P,) 



(20) 



where Ui defines the evolution of the system between the measurements of Ai_i and Ai. 
When the Ak are projectors, Ak = \xk){xk\, we can write the sequential weak value as 



{An,...Ai)^ 



|C^n|a:„-i) . . . {xi\Ui\ipi) 



amplitude (x) 



J^yii^fPn+llyn) (j/n | t^n Iz/n-l ) • ■ • (^1 1 C^l IV'i) amplitude(y) ^ 



(21) 



which shows that, in this case, the weak values has a natural interpretation as the amplitude for following the path 
defined by the Xk- Figure [5] shows an example taken from where the path (labelled by '1' and '2' successively) is a 
route taken by a photon through a pair of interferometers, starting by injecting the photon at the top left (with state 
and ending with post-selection by detection at the bottom right (with final state |V')/)- 
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In the last section, the cumulant was defined for expectations of products of variables. One can define the cumulant 
for other entities by formal analogy; for instance for density matrices [loj . or hypergraphs @. We can do the same 
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for sequential weak values, defining the cumulant by © with {flj^ q) replaced by (A{,^(|f,^.|), . . . Af,^.(i))t„, where the 
arrow indicates that the indices, which run over the subset 6j, are arranged in ascending order from right to left. For 
example, for n = 1, {A^Y = and for n — A 

(A4,A3,yl2,Ai)^ = (A4, A^,A2,Ai)^ ~ y"CA,,Aj,Ak)n,{Ai)n, - y"{A,,Aj)^(Ak,Ai)^ (22) 



+ 2^{A,,Aj)^{Ak)n,{Ai)^-Q{Ai)^{A2)n,{A:i)^{Ai)y,. 
There is a notion of independence that parallels (jlSp : given a disjoint partition Si\J S2 = {1, . . . , rt} such that 



(^s;us^)^ = (23) 

for any subsets 5^' C 5^, then we say the observables labelled by the two subsets are weakly independent. There is 
then an analogue of Lemma III. 31 

Lemma III.l. The cumulant (A„, . . . ,v4i)^ vanishes if the Ak are weakly independent for some subsets Si, 82- 

As an example of this, if one is given a bipartite system ® , and initial and final states that factorise as 
|'0i) = IV'i)"^® and — lip f)"^ C$\tp f)^ , then observables on the v4- and B-parts ofthe system are clearly weakly 
independent. Another class of examples comes from what one might describe as a "bottleneck" construction, where, 
at some point the evolution of the system is divided into two parts by a one-dimensional projector (the bottleneck) 
and its complement, and the post-selection excludes the complementary part. Then, if all the measurements before 
the projector belong to 5'i and all those after the projector belong to S2, the two sets are weakly independent. This 
follows because we can write 

{iJjf\Un+iAn . . . Uk+iAkWk\^b){tpb\ykAk-i ■ ■ ■ AiUi\xlJi) 



^ ^i^'^^^"'" {^f\Un+l . . . Uk+lWMii'blVk . . . Ui\tP^) 

^ (i>f\Un+iAn . . . Uk+iAkWMiAlVk ■ ■ ■ Ui\A) {^f\Un+i ■ ■ ■ Uk+iW^ (MykAk-i ■ ..AiUil^,) 

(^/|c/„+i . . . Uk+iWk\iJb){i^b\Vk . . . Ui\^^) • • ■ Uk+iWM{iJb\Vk . . . c/ilVz) 

= (^)^(^)», 

where Wk\4>b) {'ipb\Vk is the part of Uk lying in the post-selected subspace. As an illustration of this, suppose we 
add a connecting link (Figure [H "L") between the two interferometers in Figure [TJ so \il'b){''Pb\, the bottleneck, is 
the projection onto L, and post-selection discards the part of the wavefunction corresponding to the path L' . Then 
measurements at '1' and '2' are weakly independent; in fact (Ai)to = 1/2, (A2)w — 1/2 and (^2,^1)^, — 1/4. 
Note that the same measurements are not independent in the double interferometer of Figure [U where {Ai)^ — 0, 
(^2)^ = 0, and yet, surprisingly, (A2,Ai)„ = -1/2, [!]■ 
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IV. THE MAIN THEOREM 



Consider n system observables Ai, . . . , An- Suppose Sk, for k = 1, . . . ,n, are observables of the fcth pointer, namely 
Hermitian functions Sk{qk,Pk) of pointer position qk and momentum pk, and the interaction Hamiltonian for the weak 
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measurement of system observable Ak is Hk = QkSkAk, where gk is a small coupling constant (all gk being assumed 
of the same order of magnitude g). Suppose further that the pointer observables are measured after the coupling. 
Let 0fc be the fc-th pointer's initial wave-function. For any variable Xk associated to the fc-th pointer, write {xk)i for 

{4'k\xk\4>k}- 

We are now almost ready to state the main theorem, but first need to clarify the measurement procedure. When we 
evaluate expectations of products of the for different sets of pointers, for instance when we evaluate {rir2), we have 
a choice. We could either couple the entire set of n pointers and then select the data for pointers 1 and 2 to get (rir2). 
Or we could carry out an experiment in which we couple just pointers 1 and 2 to give (rir2). These procedures give 
different answers. For instance, if we couple three pointers and measure pointers 1 and 2 to get (rir2), in addition 
to the terms in gi, g2 and gig2 we also get terms in 3253 and gigz involving the observable A3. This means we get 
a different cumulant (ri . . .r^Y^ depending on the procedure used. In what follows, we regard each expectation as 
being evaluated in a separate experiment, with only the relevant pointers coupled. It will be shown elsewhere that, 
with the alternative definition, the theorem still holds but with a different value of the constant ^. 

Theorem IV. 1 (Cumulant theorem). For n>2, for any pointer observables rk and Sk, and J or any initial pointer 
wavef unctions (j)k, up to total order n in the gk, 

in... r^Y = 51 ■ • ■ 9nRe {C(A„, . . . , AiYJ , (24) 
where ^ (sometimes written more explicitly as ^ri...r„j is given by 

(n n \ 

l[{rkSk)^-l[{rk)^{sk)^] ■ (25) 
fe=l k=l J 

For n — I the same result holds, but with the extra term {r)i: 

{r)^{r),+gRe{^A^). (26) 

Proof. We use the methods of [1] to calculate the expectations of products of pointer variables for sequential weak 
measurements. Let the initial and final states of the system be and \^f), respectively. Consider some subset 
b — {61, . . . , b^} of {1, ... , n}, with &i < ^'2 < • ■ ■ < ^k- The state of the system and the pointers 61, . . . , 6^ after the 
coupling of those pointers is 

^SM - C/n+i . . . C/b,+ie-^9'"=^'"=^''« f/fc, . . . e-^f"! ^"i^-.! Ui,... /7i|V.,)0fc, {r^, )...<Pb^ (r^J, (27) 

and following post-selection by the system state \Y'f), the state of the pointers is 

= {iJs\Un+i ■ ■ ■Ub^+ie~'''-''-^''^Ub^ . . .e-'^^^'^^^^Wb, . . .U^\i}^) (28) 

Expanding each exponential, we have 

_ !^Mrb n.^Mdrb dr,^ ^ 

J a^'hi ■■■drt^ 

— fci 

_ n...i„gb ; 3i...j„gb "'i---'>''^3i---3t"i6ij!,i ■■■^ib^jb^ 
~ — 61 f)^ ' 

2^11. ..i„eb ; Ji...i„e& °^ii---i^°^3i---j^^ibi]bi ■■■'"ib^Jb^ 

where ik > are integers, ii, . . . , z„ G & means that ii — Q for I ^ 6, and 



(30) 




■ (31) 



U 



L= I [ni\)-\~isky^c^k{rk)rk{l\r\-isk)'<t>k{rk)drk, (32) 



/ (m!)-l(-^Sfe)™</.fc(rfe)(/!)-l(-^Sfc)Vfc(rfc)drfe. (33) 



Let us write (1501) as 



{rb^...rbY) = ^ , (34) 
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where 



= au...,„aji...j„u^^^^^.^^ . . . u'l;;^j^^ , (35) 



yk;i = afci...fc„a/i...i„f4j,^ • • • ' (36) 



and i denotes the index set {ii . . . in}, etc.. Define 



iefcjeb iebjefc 



Then 



(ri,...,r„y= ^ (fc-l)!(-l)'=-in(^f.,(i)---^^',(hl)) (38) 

bi,...,bk 1=1 

= y: (k-m-ir^ii^. (39) 

bi,...,bk 1=1 

Set y — rihcfi n} where b in the product ranges over all distinct subsets of the integers {!,..., n}. Then 
y{ri . . . TnY is an (infinite) weighted sum of terms 



where 



Zl — (a;i(i);j(i) . . . 2:i(m);j(m))(?/k(l);l(l) ■ • ■ 2/k(m');l(m') ) ' (^'^) 

UZj UZfc Ulz (41) 
= {i(l), . . . , i(m)} U {j(l), . . . , j(m)} U {k(l), . . . , k(m')} U {1(1), . . . , l(m')} 

denotes the set of all the index sets that occur in zx- The strategy is to show that, when the size of the index set 
I is less than n, the coefficient of zx vanishes; by this implies that all coefficients of order less than n in g 
vanish. We then look at the index sets of size n, corresponding to terms of order g", and show that the relevant 
terms sum up to the right-hand side of p4|) . But if y{ri . . .rnY = 5"^ + 0(5""'"^) for some x, then we also have 
(ri . . . r^Y = 5"a; + 0(5"+^), since 3^ = 1 + 0(5). 
Let 6 = {61, . . . , 5s} be a partition of {1, . . . , n}. We say that 5 is a valid partition for X if 

(i) For each r with 1 < r < m, i(r) + j(r) e 5/, for some 6;, and we can associate a distinct hi to each r. (Here i + j 
means the index set {ii + ji, ■ ■ ■ in + jn} ■) 

(ii) For each r with 1 < r < m', k(r) + l(r) G S", for some subset S C {1, . . . , n} that is not in the partition 6, i.e. 
for which S ^ bi for any I, and we can associate a distinct to each r. Let 7(1, b) be the number of ways of 
associating a subset S to each r. 

Lemma IV. 2. The coefficient of zx in y{ri . . . rn)"^ is zero if all the index sets in X have a zero at some position r. 

Proof. If we expand y{ri . . . rnY using (|39p . each term in this expansion is associated with a partition b of {1, . . . , n}. 
Let & be a valid partition for I, and let c = {ci, . . . , Cs} denote the partition derived from b by removing r from the 
subset 6; that contains it, and deleting that subset if it contains only r. Then the following partitions include b and 
are all valid : 



c 



(1) - 



{(rci),C2,...,cJ (42) 
c^^^ = {ci, {rc2), . . . ,Cs} 

c^"^ ={ci,C2,...,(rCs)} 

= {r,ci,C2, . . . ,Cs}. 

Each partition c^*^ for 1 < z < s + 1 contributes j{X,b) to the coefficient of zj in 3^ nf=i ^c(>) /i^c(») , and since this 
term has coefficient (s — 1)!(— l)'**^^' in (|39p for partitions c^^^, c'-^-', . . . c'*^ and s!(— I)'' for c*^^"''^-', the sum of all 
contributions is zero. □ 
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From equations ([3T|) and (|4T|) . the power of g in the term zj is |/| = + + + |/;|. This, together with the 
preceding Lemma, impHes that the lowest order non- vanishing terms in y{ri . . . are zj's that have a '1' occurring 
once and once only in each position; we call these complete lowest-degree terms. 

Lemma IV. 3. The coejjicient of a complete lowest-degree term zj in y{ri . . . rnY is zero unless only one of the four 
classes of indices in I, viz. li, Ij, Ik orTi, has non- zero terms. 

Proof. Consider first the case where the indices in Tj and Ti are zero, and where both Xi and have some non-zero 
indices. Let b = {&i, . . . ,hr} be the partition whose subsets consists of the non-zero positions in index sets '\{t) in 
Xi, and let c — {ci, . . . , Cs} be some partition of the remaining integers in {1, . . . , n}. Suppose s < r. Then we can 
construct a set of partitions by mixing b and c; these have the form 

d^"^ -K,...,c,„(.Ti6i),...,(x,6,)}, (43) 

where each Xi is either empty or consists of some Ci, and all the subsets Ci are present once only in the partition. If 
any d'^™-' is eligible, all the other mixtures will also be eligible. Furthermore, the set of all eligible partitions can be 
decomposed into non-overlapping subsets of mixtures obtained in this way. 

Any mixture c?*^™^ gives the same value of 7(1, c?*^™^), which we denote simply by 7; so to show that all the contri- 
butions to the coefficient of zj cancel, we have only to sum over all the mixtures, weighting a partition with t subsets 
by {t - This gives 

Coefficient of zj = 7^" (s + r - I)!(-f )''+^-' f (''\ il 



i=Q 



I J \ l 



= ^{-iy+^-^s\ ^(,s + r - z - 1) . . . (s - z + I) Q 

The above argument applies equally well to the situation where Ti and Ti both have some non-zero indices and 
indices in Tj and T^ are zero. If the non-zero indices are present in Ti and Tj, we can take any eligible partition 
a = {fli, . . . , flj.} and divide each subset into two subsets bk and Cfc with the indices from Ti in bk and those from 
Tj in Cfc. All the mixtures of type ()43p are eligible, and they include the original partition a. By the above argument, 
the coefficients of zx arising from them sum to zero. Other combinations of indices are dealt with similarly. 

Note that, for n = A and for the index sets (1, 1, 0, 0) G Ti and (0, 0, 1, 1) £ X,-, the "mixture" argument shows that 
coefficient of zj coming from {rir2rzr4) cancels that coming from (r'ir2)(r3r4) to give zero. This cancellation occurs 
with the cumulant (|19|) . but not with the covariance (|I8p . where the term {rir2){r^r4) is absent. 

□ 

The only terms that need to be considered, therefore, are complete lowest-degree terms with non-zero indices only 
in one of the sets 1^, Tj, Tk and Z;. It is easy to calculate the coefficients one gets for such terms. Consider the case 
of Xi. We only need to consider the single partition b whose subsets are the index sets of Ti. For this partition, by 
(gni), (ESI) and 



zx = W_ai(e) n^to^^O.O = ffl ■ •■5« n (^i(e)(|i(e)|)> - •■ > A(e)(l))^ n^'"'^*'^^^ 
e=l k=l e=l k=l 

From (|39p . zx appears in y{ri . . . rnY with a coefficient [t — 1)!(— 1)*^^. So, summing over all zx with indices in Ti, 
one obtains g\ . . . (7„(^„, . . . ,A\f^ nfc=i(~*(''fe'5fe)i)- Similarly, from PT|) . (I32p and summing over the zx with 

indices in Tj gives the complex conjugate of gi . . . (7„(^„, . . . , A\fw nfe=i(^*(^fe'^fe)i)- Thus Ti and Tj together give 
51 ■ • ■ 5«(2 VCk=i{-^{rkSk)i))Re • ■ • , 

This corresponds to ([24|) . but with only the first half of ^ as defined by (|25|) . The rest of ^ comes from the index sets 
Tk and X/. However, the sum of the coefficients of zx for the same index set in X; and Tk is zero. This is true because, 
for any complete lowest degree index set, the sum of coefficients for all zx with the indices divided in any manner 
between Ti and Tk is zero, being the number ways of obtaining that index set from y times X]"=i(^ ^ But 
by Lemma riV.3[ the coefficient of zx is zero unless the index set comes wholly from Xj or Tk. Now ([55]) and ([55]) 
tell us that, for an index set in Tk, 

t n t n 

ZX = n^Ue) n^O.O^to = 9l---9nY[ (A(e)(|i(e)|), - ■• , A(e)(l))^ H ^'"'^ ) * ^'^fe) * ' 
e=l fc=l e=l k=l 
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and from the above argument, this appears appears in y{ri . . .VnY with coefScient —{t — 1)!(— Again, the 
index sets in Ti give the complex conjugate of those in Xk. Thus we obtain the remaining half of ^, which proves (|24p 
for n > 2. For n — \ the constant terms (of order zero in g) in y{r) do not vanish, but the proof goes through if we 
consider y{{r) — {r)i) instead. 

□ 

V. EXPLORING THE THEOREM 

Consider first the simplest case, where n = 1 and r = q. We take Hint = gS(t)pA throughout this section, so s = p. 
Then ([26]) and ([25]) give 

{q) = {q)^ + gRei^M with ^, = -2i {{qp), - {q)^{p)^) , (46) 
which we have already seen as equations @ and ([S]). If we measure the pointer momentum, so r = p, we find 

{p) = {p)^ + gRei^pA^) with = -2»((/). - (p)-), (47) 

which is equivalent to the result obtained in 

For two variables, our theorem for ri — qi,r2 = q2, is 

{qiq2r = 5i32i?e(eg,(^2, Ai)^), (48) 

with 

iqq '2{{qi)i{Pl)t{q2)i{P2)t " {qiPl) i{q2P2) i) ■ (49) 

The calculations in the Appendix allow one to check ((48|) and (|49|) by explicit evaluation; see (|A3|1 . Note in passing 
that, if one writes Ag = {{qi — (qi))^), the Cauchy-Schwarz inequality 

{{qiq2rr = {((91 - ('Zi))fe - {q2))V < ((-Zi ^ ('Zi))')((92 - (92))') 
implies a Heisenberg-type inequality 

AqiAqa > gi.92i?e{e,,(A2,^i)^}, 

relating the pointer noise distributions of two weak measurements carried out at different times during the evolution 
of the system. 

When one or both of the qk in (I48|) is replaced by the pointer momentum pk , we get 

{qiP2Y = 9ig2Re i^qpiA2,AiYJ , (50) 

{piP2y = 9ig2Re i^ppiA2,Aiyj , (51) 

with 

^,p = -2 {{qlPl)M)^ - {ql}^{Pl)^{P2)l) , (52) 

^pp^-2{(plUplh-(p^)^{p2)l). (53) 

Consider now the special case where is real with zero mean. Then the very complicated expression for {qiq2) in 
(|Aip reduces to 

(gi<Z2) - ^ i?e [{A2,Ai)^, + (Ai)„(A2)^] , (54) 
as shown in Two further examples from are 

(919293) = Re[{A3,A2,Al)^ + {A3,A2)UAl)^, + iA3,Al)^{{A2)^, + {A2,Al)UA3U] , (55) 

(91929394) = ^i^^^ i?e[(A4,A3,A2,Ai)^ + (yl4,A3,^2)«,(Ai)«- + ... + (A4,A3)^(A;;^)^, + ...] . (56) 
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We can use these formulae to calculate the cumulant {qi . . . Qn), and thus check Theorem II V . 1 f or this special class of 
wavefunctions 4>. Each formula contains on the right-hand side a leading sequential weak value, but there are also 
extra terms, such as {Ai)jjj{A2)w in ([54| and (yl2, Ai)u,(j43)m in ([55| . All these extra terms are eliminated when the 
cumulant is calculated, and we are left with ([M)) with ^gi...g„ = (1/2)"^^. 

This gratifying simplification depends on the fact that the cumulant is a sum over all partitions. For instance, 
it does not occur if one uses the covariance instead of the cumulant. To sec this, look at the case n ^ 4: The 
term {qiq2q3q4:) in Cov{qi,q2,q3,qi), the covariance of pointer positions, gives rise via ([56|) to weak value terms like 
{A4, A3)yj{A2, Ai)^. However, together with ([5^ . ([55)1 and ([55)1 show that Cof (gi, 92, 93, 94) has no other terms 
that generate any multiple of (^4, ^3)^,(712, ^1)^, and consequently this weak value expression cannot be cancelled 
and must be present in Cow (gi, (72, 93 , 94) • This means that there cannot be any equation relating Cow(gi, 92, ?3, 94) 
and Cof (A4, A3, A2, This negative conclusion does not apply to the cumulant (q'i929394)'^j as this includes 
terms such as (gi(?2)(93?4); see 



VI. SIMULTANEOUS WEAK MEASUREMENT 



We have treated the interactions between each pointer and the system individually, the Hamiltonian for the fc'th 
pointer and system being H^. — gkS{t — ti;)s^.Ak, but of course we can equivalently describe the interaction between 
all the pointers and the system by -ff = gkS{t — tk)3kAk. For sequential measurements we implicitly assume that 
all the times tk are distinct. However, the limiting case where there is no evolution between coup ling of the pointers 
and all the tfc's are equal is of interest, and is the simultaneous weak measurement considered in jl4i [isl. [l6j. In this 
case, the state of the pointers after post-selection is given by 

"fM = {i^f\e-'^''''^'■■■+'-'-''-^\i^^)Mn) ■ ■ - Mrn). (57) 

The exponential e"*^^^*'^"^! ""'"^"*""*"'' here differs from the sequential expression e~'^"^"^" . . . e~*^i''i'^i in P5|) in 
that each term in the expansion of the latter appears with the operators in a specific order, viz. the arrow order 
^ as in (|22p . whereas in the expansion of the former the same term is replaced by a symmetrised sum over all 
orderings of operators. For instance, for arbitrary operators X , Y and Z, the third degree terms in e-^ include 
X3/3!, X^Y/2\ and XYZ, whose counterparts in e(^+^+^) are, respectively, X^/3\, {X^Y + XYX + YX^}/3\ and 
{XYZ + XZY + YXZ + YZX + ZXY + ZYX}/3\. Apart from this symmetrisation, the calculations in Section Hv] 
can be carried through unch ang ed for simultaneous measurement. Thus if we replace the sequential weak value by 
the simultaneous weak value [ij, [l^, [lB| 

(Ai^, . . . , Aijtus = — ^ (Ai^jj^j , . . . , Ai^jjj)^ , (58) 

where the sum on the right-hand side includes all possible orders of applying the operators, we obtain a version of 
Theorem IIV. 1 1 for simultaneous weak measurement: 

(n . . . r„y = 51 • • • 9nRe {^(A„, . . . , Ai)^ J . (59) 

Likewise, relations such (|54p . ([55]) . etc., hold with simultaneous weak values in place of the sequential weak values; 
indeed, these relations were first proved for simultaneous measurement [l3, [l^ ■ 




FIG. 3: 

From (j58p we see that, when the operators Afc all commute, the sequential and simultaneous weak values coincide. 
One important instance of this arises when the operators A^ are applied to distinct subsystems, as in the case of the 
simultaneous weak measurements of the electron and positron in Hardy's paradox [13, [3 ■ 
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When the operators do not commute, the meaning of simultaneous weak measurement is not so obvious. One 
possible physical interpretation follows from the well-known formula 

e^+^ = hm (e^/^e^/^)^ (60) 

AT— »oo 

and its analogues for more operators. Suppose two pointers, one for Ai and one for A2, are coupled alternately in 
a sequence of TV short intervals (Figure [31 top diagram) with coupling strength gk/N for each interval. This is an 
enlarged sense of sequential weak measurement [2\ in which the same pointer is used repeatedly, coherently preserving 
its state between couplings. The state after post-selection is 

*M - (^/l [e-<^>''''e-<^>^''^Y \^,)Mri)Mr2). (61) 

From ([60]) we deduce that 

■^M « {'4>f\e-'^''''^'^''''^'^\i^.)Mri)Mr2). (62) 

This picture readily extends to more operators A^.. 

One can also simulate a simultaneous measurement by averaging the results of a set of sequential measurements 
with the operators in all orders; in effect, one carries out a set of experiments that implement the averaging in (j58p . 
There is then no single act that counts as simultaneous measurement, but weak measurement in any case relies on 
averaging many repeats of experiments in order to extract the signal from the noise. In a certain sense, therefore, 
sequential measurement includes and extends the concept of simultaneous measurement. However, if we wish to 
accomplish simultaneous measurement in a single act, then we need a broader concept of weak measurement where 
pointers can be re-used; indeed, we can go further, and consider generalised weak coupling between one time-evolving 
system and another, followed by measurement of the second system. However, even in this case, the measurement 
results can be expressed algebraically in terms of the sequential weak values of the first system . 



VII. LOWERING OPERATORS 



Lundeen and Resch ^16i] showed that, for a gaussian initial pointer wavefunction, if one defines an operator a by 

/ 2\l/2 f , iP \ 



then the relationship 



(aLR) = g(p'^)y^An 



holds. They argued that a^n can be interpreted physically as a lowering operator, carrying the pointer from its first 
excited state |1), in number state notation, to the gaussian state [0) (despite the fact that the pointer is not actually 
in a harmonic potential). Although ola is not an observable, (aLR) can be regarded as a prescription for combining 
expecations of pointer position and momentum to get the weak value. 
If instead of qlr one takes 

~ (63) 



then the even simpler relationship 

(a) = gA^, (64) 

holds. We refer to a as a generalised lowering operator. 

Lundeen and Resch also extended their lowering operator concept to simultaneous weak measurement of several 
observables Ak . Rephrased in terms of our generalised lowering operators at defined by (|63|) , their finding (l6l | can be 
stated as 

(ai . . . a„) = 51 . . . gn{Ai . . . An)^^- (65) 
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This is of interest for two reasons. First, the entire simultaneous weak value appears on the right-hand side, not just 
its real part; and second, the "extra terms" in the simultaneous analogues of ([54| . ((55|) and ([56| have disappeared. 
The lowering operator seems to relate directly to weak values. 

We can generalise these ideas in two ways. First, we extend them from simultaneous to sequential weak measure- 
ments. Secondly, instead of assuming the initial pointer wavefunction is a gaussian, we allow it be arbitrary; we do 
this by defining a generalised lowering operator 

a^q + i — , with V = —i£,v/^a- (66) 
77 ^ ^ 

For a gaussian 0, = 2(p^)i, so the above definition reduces to ([55)1 in this case. In general, however, (/> will not 
be annihilated by a and is therefore not the number state |0) (this state is a gaussian with complex variance 'r]~^). 
Nonetheless, there is an analogue of Theorem lIV.il in which the whole sequential weak value, rather than its real part, 
appears: 

Theorem VII. 1 (Cumulant theorem for lowering operators). For n > 1 

(ai . . . an)" ^ gi-.-Qn '& {An, ■ ■ ■ ^i)^, (67) 

where 'd is given by 

E £ ■ ^''^ 

(ii,...i„)G{0,l}" ■ ■ ' ^P"' 

For n — I the same result holds, but with the extra term (a)^; 

(a) ^ {a),+^gA^. (69) 

Proof. Put ro = q, J'l = p. Then 

(ai . ..OnY = {{qi + ipi/Vl) ■ ■ ■ iln + ipn/nn))", 

(ii,...4„)e{o,i}" fei ■ ■ '^P") 

= 5i...5„ [&{An,...Ai)l + w{An,...AiYJ\ , 



where we used Theorem IIV. 1 1 to get the last line, and where d is given by (l68|) and m by 

„ (-1)^'^C.,.....„ 

^ 27! T~) ' 

(note the bar over <^^.^ ^. that is absent in the definition of d by 

We want to prove = 0, and to do this it suffices to prove that the complex conjugate of the numerator is zero, 
i.e. 

(ii,...i„)e{o,i}" 

Let Ofc = {qkSk)i, bk = (qk)i{sk)i, Ck = {pkSk)i, dk = {pk)i{sk)i- Using the definition of £ in (gS]), the above equation 
can be written 



n77(2"+i(-l)") = J] {okick - dk) - Cfc(afe - bk)} - [] {^^(c/c - 4) - 4-(afe - bk)} 
fe=i fe=i 

= n(&feCfe - akdk) - J|(fofcCfc - Okdk) = 0. 



□ 
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Suppose the interaction Hamiltonian has the standard von Neumann form Hint = gpA, so s = p in the definition 
of £, by equation ([25]). Then for n = 1, since 1p = and (qp)- = {pq)i, •& = {-i){£,q - C,) = {-i){{qp)i - (pq)i) = 1, 
so we get the even simpler result 

(a) ^ {a)^+gA^. (70) 

This is valid for all initial pointer wavefunctions, and therefore extends Lundeen and Resch's equation (j64p . It seems 
almost too simple: there is no factor corresponding to ^ in equation (|46p . However, a dependency on the initial pointer 
wavefunction is of course built into the definition of a through rj. 

For 71 > 1 it is no longer true that "i? = 1, even with the standard interaction Hamiltonian. However, if in addition 
= 0, then 

n 

fe=i 

Thus (ai . . . Qn)'' — gi . . . gn{An, . . . Ai)'^ for all n. Applying the inverse operation for the cumulant, given by Propos- 
tion lH.H we deduce: 

Corollary VII. 2. // (p)i — 0, 6.17. if the initial pointer wavefunction <f> is real, then for n > 1 

(ai . . .On) = 51 ■ ■ -gniAn, ■ ■ -Ai)^. (71) 

This is the sequential weak value version of the result for simultaneous measurements, (|65|) . but is more general 
than the gaussian case treated in [l6l |. 

We might be tempted to try to repeat the above argument for pointer positions instead of the lowering operators 
flfc by applying the anti-cumulant to both sides of (pi)) . This fails, however, because of the need to take the real part 
of the weak values; in fact, this is one way of seeing where the extra terms come from in (j54p . (jSSp and (j56p and their 
higher analogues. 

Note also that ()7ip does not hold for general 0, since then different subsets of indices may have different values of 

7?. 



VIII. DISCUSSION 



The procedure for sequential weak measurement involves coupling pointers at several stages during the evolution 
of the system, measuring the position (or some other observable) of each pointer, and then multiplying the measured 
values together. In it was argued that we would really like to measure the product of the values of the operators 
Ai, . . .An, and that this corresponds to the sequential weak value [An, . . . ,^i)to. Multiplication of the values of 
pointer observables is the best we can do to achieve this goal. However, this brings along extra terms, such as 
(Al)»(A2) w in (|54p ■ which are an artefact of this method of extracting information. From this perspective, the 
cumulant extracts the information we really want. 

In a somewhat idealised measuring device was being considered, where the pointer position distribution is real 
and has zero mean. When the pointer distribution is allowed to be arbitrary, the expressions for {qi . . . qn) become 
wildly complicated (see for instance (jAip ). Yet the cumulant of these terms condenses into the succinct equation ([M)) 
with all the complexity hidden away in the one number ^. Why does the cumulant have this property? 

Recall that the cumulant vanishes when its variables belong to two independent sets. The product of the pointer 
positions gi, . . . will include terms that come from products of disjoint subsets of these pointer positions, and the 
cumulant of these terms will be sent to zero, by Lemma III.3I For instance, with n = 2, the pointers are deflected in 
proportion to their individual weak values, according to ([4]), and the cumulant subtracts this component leaving only 
the component that arises from the 0(5^)-infiuence of the weak measurement of Ai on that of The subtraction 
of this component corresponds to the subtraction of the term {Ai)w{A2)w from (f54|) . In general, the cumulant of 
pointer positions singles out the maximal correlation involving all the qi , and the theorem tells us that this is directly 
related to the corresponding "maximal correlation" of sequential weak values, (A„, . . . ,AiY, which involves all the 
operators. 

In fact, the theorem tells us something stronger: that it does not matter what pointer observable r{p, q) we measure, 
e.g. position, momentum, or some Hermitian combination of them, and that likewise the coupling of the pointer with 
the system can be via a Hamiltonian Hint — gs{p, q)A with any Hermitian s(p, q). Different choices of r and s lead 
only to a different multiplicative constant ^ in front of (A„, . . . , Ai)^ in ([M]) . We always extract the same function of 
sequential weak values, {Am . . . , ^i)^, from the system. This argues both for the fundamental character of sequential 
weak values and also for the key role played by their cumulants. 
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APPENDIX A: AN EXPLICIT CALCULATION 



To calculate ((71(72) for arbitrary pointer wavefunctions and (j)2, we use (j28p to determine the state of the two 
pointers after the weak interaction, and then evaluate the expectation using (|29p . keeping only terms up to order g^. 
We define 

Mfc = {qk)t, = {pk)t, Ck = {pI)i, 

Pk = {qkPk)i, cTfc = {qkpl)i, Tk = {pkqkPk)i, 

Then, expanding the exponential in (I^S)) and substituting 5' in (I^S)) gives, up to order g^, 

{qiq2) = - igi {{{Ai)^ - {Ai)^) p,iviH2 - (^i)«,piAi2 + (^i)tuPiA*2} (Ai) 

- ig2 {{{A2)w - {A2)w) MlAi2i^2 - {A2)wP-lP2 + {A2)wPlP2} 

+ gl ||(^i)^P(TiAi2 - MiCiAi2) + {{Al)^ + 

- 5? {((^1)^ - {Ar)^Ypi^lp2 + [AD^^ + {Al\ 
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+ gl\\{A2U''{pir2 - P1P2C2) + {{AlU + (^2)») ^^^} 

- 9i{{iA2U - {A^)2f PiP2i^l + (AlU^ + {AlU^} 

+ 51^2 |(^l)tu(^2)to/Ol/02 + iAi)n,{A2)wPlP2 ~ (^2 , )„,piP2 " (^2 , ^1 )u,PlP2 | 

- gig2 {2 ((Ai)^ - (Ai)^) {{A2)ni ~ (^2)«)) A*l'^lM2J^2} 

+ 5132 I ((^2, ^l)™ + (^2, Ai)^ - iAi)yj{A2)yj - iAi)^,{A2)ro'j PliyiP2'^2^ 
+ gl{ ((Ai)^ - (Ai)^) {Ai)^ViPi^2 - {{Ai)nj - (Ai)^) (Ai)^Z^iPi^2} 
+ 52 { {{A2)w - {A2)w) {A2)wPlV2P2 ^ ((^2)iu ^ (^2)u.) (^2)«,A^i;^2P2 } 
+ 5l52 {{{Aljw - {Aijn}) {A2)wPlVlP2 ^ " (^2 ) i^l/02 } 

+ 5l52 {((^2)«, - {A2)w) {Ai)y,piP2V2 - {{A2)w " {M)^) {Ai)wPlP2V2] ■ 

To calculate the cumulant ((71,(72)^ — ((71(72} — ((?i)((72) we need {q) up to order g^: 

{q) = p + ig {A^^ipiy - p) - A^{niy - p)} + g'^\A^\'^ {t - pQ + 2piy^ - up- vp) (A2) 



^2 2) ' \2 2^ 
Substituting from (jAip and (jA2p a radical simplification occurs: 

{qiq2y = 5152 {(^2,^1)^ - {Ai)^{A2)ni} {piyiP2V2 - P1P2) + complex conjugate. (A3) 



This, of course, is what Theorem IIV. 1 1 tells us. 
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